Systolic architectures for connected speech recognition

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid architectures for speech recognition

The state-of-the-art automatic speech recognition (ASR) systems utilize a statistical pattern recognition framework called HMM/GMM (Hidden Markov Model / Gaussian Mixture Model) with short time spectral features such as Mel Frequency Cesptral Coefficients (MFCC) or Perceptual Linear Prediction (PLP). Although this approach has been shown to be effective in capturing speech patterns, recent perf...

متن کامل

A Systolic FPGA Architecture of Two-Level Dynamic Programming for Connected Speech Recognition

In this paper, we present an efficient architecture for connected word recognition that can be implemented with field programmable gate array (FPGA). The architecture consists of newly derived two-level dynamic programming (TLDP) that use only bit addition and shift operations. The advantages of this architecture are the spatial efficiency to accommodate more words with limited space and the ab...

متن کامل

Mandarin connected digits recognition for whispered speech

In this paper, the acoustic characteristics and recognition of whispered speech are discussed. A Mandarin digits database is built both in normal speech and whispered speech. The collected speech materials of normal and whispered speech are analyzed to verify the characteristics and differences for the two kinds of speech. Cross recognition is carried out using normal and whispered speech as tr...

متن کامل

Hybrid SVM/HMM architectures for speech recognition

In this paper, we describe the use of a powerful machine learning scheme, Support Vector Machines (SVM), within the framework of hidden Markov model (HMM) based speech recognition. The hybrid SVM/HMM system has been developed based on our public domain toolkit. The hybrid system has been evaluated on the OGI Alphadigits corpus and performs at 11.6% WER, as compared to 12.7% with a triphone mixt...

متن کامل

Speech Recognition Architectures for Multimedia Environments

Computer workstations have recently become powerful enough to support speech recognition entirely in software, but speech recognizers still vary in their functionality, and each vendor offers their own programmatic interface. Developing recognition applications currently means writing to non-portable protocols. As new improved recognizers become available, such applications will need to be rewr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Acoustics, Speech, and Signal Processing

سال: 1986

ISSN: 0096-3518

DOI: 10.1109/tassp.1986.1164918